A Deterministic Word Dependency Analyzer Enhanced With Preference Learning

نویسندگان

  • Hideki Isozaki
  • Hideto Kazawa
  • Tsutomu Hirao
چکیده

Word dependency is important in parsing technology. Some applications such as Information Extraction from biological documents benefit from word dependency analysis even without phrase labels. Therefore, we expect an accurate dependency analyzer trainable without using phrase labels is useful. Although such an English word dependency analyzer was proposed by Yamada and Matsumoto, its accuracy is lower than state-of-the-art phrase structure parsers because of the lack of top-down information given by phrase labels. This paper shows that the dependency analyzer can be improved by introducing a Root-Node Finder and a Prepositional-Phrase Attachment Resolver. Experimental results show that these modules based on Preference Learning give better scores than Collins’ Model 3 parser for these subproblems. We expect this method is also applicable to phrase structure parsers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Desarrollo de un Analizador Sintáctico Estadístico basado en Dependencias para el Euskera

This paper presents the first steps towards a statistical syntactic analyzer for Basque. The system is based on a syntactically dependency annotated treebank and an adaptation of the deterministic syntactic analyzer of Nivre et al. (2007), which relies on a shift/reduce deterministic analyzer together with a machine learning module that determines which one of 4 analysis options to take, giving...

متن کامل

Japanese Dependency Parsing Using a Tournament Model

In Japanese dependency parsing, Kudo’s relative preference-based method (Kudo and Matsumoto, 2005) outperforms both deterministic and probabilistic CKY-based parsing methods. In Kudo’s method, for each dependent word (or chunk) a loglinear model estimates relative preference of all other candidate words (or chunks) for being as its head. This cannot be considered in the deterministic parsing me...

متن کامل

Machine Learning-based Dependency Analyzer for Chinese

In this paper, we present a deterministic dependency structure analyzer for Chinese. This analyzer implements two algorithms – Yamada and Nivre algorithms – and two sorts of classifiers – Support Vector Machines and Maximum Entropy models. We compare the performance of these 2x2 combinations. We evaluate the methods on a dependency tagged corpus derived from the CKIP Treebank corpus. Then, we a...

متن کامل

Chinese Deterministic Dependency Analyzer: Examining Effects of Global Features and Root Node Finder

We present a method for improving dependency structure analysis of Chinese. Our bottom-up deterministic analyzer adopt Nivre’s algorithm (Nivre and Scholz, 2004). Support Vector Machines (SVMs) are utilized to determine the word dependency relations. We find that there are two problems in our analyzer and propose two methods to solve them. One problem is that some operations cannot be solved on...

متن کامل

Exploiting Web-Derived Selectional Preference to Improve Statistical Dependency Parsing

In this paper, we present a novel approach which incorporates the web-derived selectional preferences to improve statistical dependency parsing. Conventional selectional preference learning methods have usually focused on word-to-class relations, e.g., a verb selects as its subject a given nominal class. This paper extends previous work to wordto-word selectional preferences by using webscale d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004